Effective searching of RDF knowledge bases
نویسنده
چکیده
RDF data has become a vital source of information for many applications. In this thesis, we present a set of models and algorithms to effectively search large RDF knowledge bases. These knowledge bases contain a large set of subjectpredicate-object (SPO) triples where subjects and objects are entities and predicates express relationships between them. Searching such knowledge bases can be done using the W3C-endorsed SPARQL language or by similarly designed triple-pattern search. However, the exact-match semantics of triple-pattern search might fall short of satisfying the users needs by returning too many or too few results. Thus, IR-style searching and ranking techniques are crucial. This thesis develops models and algorithms to enhance triple-pattern search. We propose a keyword extension to triple-pattern search that allows users to augment triple-pattern queries with keyword conditions. To improve the recall of triple-pattern search, we present a framework to automatically reformulate triple-pattern queries in such a way that the intention of the original user query is preserved while returning a sufficient number of ranked results. For efficient query processing, we present a set of top-k query processing algorithms and for ease of use, we develop methods for plain keyword search over RDF knowledge bases. Finally, we propose a set of techniques to diversify query results and we present several methods to allow users to interactively explore RDF knowledge bases to find additional contextual information about their query results.
منابع مشابه
Mapping of Owl Ontology Concepts to Rdb Schemas
Modern technologies of Semantic Web, the growing complexity of information systems, the needs of knowledge bases and smart Web agents require conceptual models to gain an improved form of semantic knowledge models – i.e. ontology. Currently, the main technique of storing ontology is based on files containing descriptions of ontology in RDF/RDFS or OWL. On the other hand, the relational database...
متن کاملRDF2Rules: Learning Rules from RDF Knowledge Bases by Mining Frequent Predicate Cycles
Recently, several large-scale RDF knowledge bases have been built and applied in many knowledge-based applications. To further increase the number of facts in RDF knowledge bases, logic rules can be used to predict new facts based on the existing ones. Therefore, how to automatically learn reliable rules from large-scale knowledge bases becomes increasingly important. In this paper, we propose ...
متن کاملEfficient Management of Spatial RDF Data
The RDF data model has recently been extended to support representation and querying of spatial information (i.e., locations and geometries), which is associated with RDF entities. Still, there are limited efforts towards extending RDF stores to efficiently support spatial queries, such as range selections (e.g., find entities within a given range) and spatial joins (e.g., find pairs of entitie...
متن کاملKnowledge Enhanced Searching on the Web
The move towards a semantic web has been in progress for many years and more recently there have been applications that make use of semantic web technology. One of the features that made the Web so easy to use is the ability to search web pages in a matter of seconds through the use of search en gines. Now that the use of OWL and RDF as a knowledge representation format is increasing, the poss...
متن کاملAccelerating Rule-Based Reasoning in Disk-Resident RDF Knowledge Bases
Collections of tens of millions of automatically extracted facts represented using the subject-predicate-object RDF model are available for several domains. As big as these collections are, they are unable to capture all information about a domain, simply because the sources from which they were extracted are incomplete. This can be tackled by creating knowledge bases where facts are enforced w...
متن کامل